A Parallel Algorithm for Estimating Genome-Wide Gene Networks using Nonparametric Bayesian Networks

نویسندگان

  • Yoshinori Tamada
  • Seiya Imoto
  • Hiromitsu Araki
  • Masao Nagasaki
  • Satoru Miyano
چکیده

We present a novel algorithm for estimating genome-wide gene networks using nonparametric Bayesian network models [3]. The algorithm, which is called the Neighbor Node Sampling & Repeat (NNSR) algorithm, is capable of searching a Bayesian network structure consisting of more than 20 000 nodes, which is fitted to given gene expression data. To realize the large scale Bayesian network structure search, the algorithm is designed to run on massively parallel computers where a massive amount of independent computation nodes are linked by fast connection. Such a computer system is also known as a distributed-memory architecture supercomputer. A Bayesian network is widely used as a gene network model [2]. Learning of a Bayesian network structure from gene expression data, however, is known as an NP-hard problem, and therefore the optimal network structure can be estimated only up to less than 30 genes or fewer. Thus, for the larger network, a heuristics algorithm such as the greedy hill-climbing (HC) algorithm is often used. The HC algorithm is applicable to networks with up to 1000 genes. A gene network with 1000 genes, however, involves fewer than 5% of all human genes. Therefore, the current Bayesian network estimation technology is far from a genome-wide scale analysis for most species.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Load-Frequency Control: a GA based Bayesian Networks Multi-agent System

Bayesian Networks (BN) provides a robust probabilistic method of reasoning under uncertainty. They have been successfully applied in a variety of real-world tasks but they have received little attention in the area of load-frequency control (LFC). In practice, LFC systems use proportional-integral controllers. However since these controllers are designed using a linear model, the nonlinearities...

متن کامل

Estimation of Products Final Price Using Bayesian Analysis Generalized Poisson Model and Artificial Neural Networks

Estimating the final price of products is of great importance. For manufacturing companies proposing a final price is only possible after the design process over. These companies propose an approximate initial price of the required products to the customers for which some of time and money is required. Here using the existing data of already designed transformers and utilizing the bayesian anal...

متن کامل

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation  method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...

متن کامل

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation  method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...

متن کامل

Estimating gene regulatory networks and protein-protein interactions of Saccharomyces cerevisiae from multiple genome-wide data

MOTIVATION Biological processes in cells are properly performed by gene regulations, signal transductions and interactions between proteins. To understand such molecular networks, we propose a statistical method to estimate gene regulatory networks and protein-protein interaction networks simultaneously from DNA microarray data, protein-protein interaction data and other genome-wide data. RES...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009